WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 
H03G 3/00 



Al 



(11) International Publication Number: 
(43) International Publication Date: 



WO 99/33173 

July 1999 (01.07.99) 



(21) International Application Number: PCT/US98/27174 

(22) International Filing Date: 21 December 1998 (21.12.98) 



(30) Priority Data: 
60/068 J 16 
Not furnished 



23 December 1 997 (23. 1 2.97) US 
2 1 December 1 998 (2 1 . 1 2.98) US 



(71) Applicant: LEXICON [US/US]; 3 Oak Park, Bedford, MA 

01730 (US). 

(72) Inventor: GRIESINGER, David; 23 Bellevue Avenue, Cam- 

bridge, MA 02140 (US). 

(74) Agents: MCCOMBS, David, L. et al.; Haynes and Boone, 
L.L.P., 3100 NationsBank Plaza, 901 Main Street, Dallas, 
TX 75202-3789 (US). 



(81) Designated States: AL. AM, AT, AU, AZ, BA, BB, BG, BR 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, GD, 
GE, GH, GM, HR, HU, ID, 1L, IN, IS, JP, KE, KG, KP, 
KR. KZ, LC, LK, LR, LS, LT, LU, LV, MD, MG. MK, 
MN, MW, MX, NO, NZ, PL, PT, RO, RU, SD, SE, SG, 
SI, SK, SL, TJ, TM, TR, TT, UA, UG, UZ, VN, YU, ZW, 
ARIPO patent (GH, GM, KE, LS, MW, SD, SZ, UG, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, 
GB, GR, IE, IT, LU, MC, NL, PT, SE), OAPI patent (BF, 
BJ, CF, CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, 
TD, TG). 



Published 

With international search report. 



(54) Title: METHOD AND SYSTEM FOR DRIVING SPEAKERS WITH A 90 DEGREE PHASE SHIFT 

(57) Abstract 

A method and system for providing enhanced cou- 
pling of a stereophonic pair of full-range, loudspeakers (5, 
6) or subwoofers (8, 9) to ail room modes of a laterally 
symmetric listening room wherein the preferred location of 
a listener is on the lateral center line thereof, by applying 
the left and right audio signals (30, 32) or the low frequency 
components thereof through a pair of all-pass phase shift 
networks having an in-phase relationship at low audio fre- 
quencies where localization is not possible. The method 
and system may further be compensated by a 3 dB bass 
boost in each channel to provide the same sound pressure 
level at low frequencies as would occur with a pair of sub- 
woofers (8, 9) placed together in one corner of the room 
and driven in phase, thereby ensuring a constant sound 
pressure level along the lateral center line of the listening 
room at all audio frequencies. 
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METHOD AND SYSTEM FOR DRIVING SPEAKERS WITH A 90 DEGREE PHASE 

SHIFT 

Cross-Reference to Related Application 

This application is based upon the U.S. Provisional Patent Application No. 60/068,716, 
entitled "Method and System for Driving Speakers With a 90 Degree Phase Shift," filed 
December 23, 1997. 
Field of the Invention 

This invention relates to a method and system of driving a pair of stereo loudspeakers, or a 
pair of loudspeakers that are part of a multiple loudspeaker array, with a non-zero optimal phase 
relationship in the low frequency region, approximating 90 degrees in the region where 
localization becomes impossible. In particular, this method and system allows room modes to be 
excited independently by both loudspeakers thereby producing a full bass response which may be 
further modified by a shelf filter to equalize the sound pressure level in the bass frequency range. 
Background of the Invention 

Stereophonic pairs of loudspeakers are used for sound reproduction in a listening room, and 
are often called upon to reproduce sound in the low frequency region where localization becomes 
impossible. Such loudspeakers are commonly used in pairs or multiples, and frequently are in 
combination with other loudspeakers intended to reproduce the higher audio frequencies. 

In some sound systems, known as satellite-subwoofer systems, the upper frequency range is 
reproduced by a pair of satellite loudspeakers with a limited response below about 200Hz, the low 
frequency signals being combined into a single monophonic subwoofer, often placed along the 
center line of the room. For such systems, this invention has no application. However, when full- 
range loudspeakers are used, or when a user installs a pair of subwoofers, the present invention, as 
described in detail below, can provide better performance than the conventional usage wherein the 
loudspeakers are driven in phase at low frequencies. 

When two speakers are driven in phase the performance of full-range speakers or dual 
subwoofers can be inferior to that of the satellite-subwoofer system. The satellite-subwoofer 
system will give identical results when the subwoofer is located along the center line of the room, 
but more bass (and a more spacious sound) can be obtained if the subwoofer is placed off axis and 
near a corner of the room. With two subwoofers driven in phase, the best placement is to put both 
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together in the same corner of the room. 

In the conventional methods of sound reproduction, a coherent phase relationship between 
the two loudspeakers has been considered necessary so that "pair-wise mixing" and the use of 
pan-pots in recording stereophonically will result in correct imaging of the reproduced sounds. 
5 There is a large body of literature on the merits of phase coherence in stereophonic reproduction 
systems, and great pains are taken, not only to match the amplitude and phase response of 
amplifiers and equalizers, but also to match pairs of loudspeakers. 

Nevertheless, much of the literature also asserts that below some low frequency, usually 
estimated as between 80Hz and 200Hz, localization of sounds is not possible. Above this 
10 frequency the amplitude difference between the loudspeakers is the primary localization cue, but 
along the centerline of a relatively absorptive room the phase relationship also has an effect. 

A neglected consequence of these assertions is that there must be some frequency below 
which the speakers need not be in phase. The question arises of what is the optimal phase 
relationship below this frequency. Research briefly described herein has indicated that there is a 
1 5 non-zero optimal phase relationship, and that the loudspeakers should ideally have a phase 
difference of 90 degrees in the low frequency region. 
Summary of the Invention 

The present invention is concerned with a method and system for driving a pair of 
loudspeakers in an appropriate phase relationship in the low frequency region, by means of a phase 
20 shifting and equalizing circuit. 

One embodiment includes a pair of all-pass phase difference networks optimally producing 
a quadrature relationship in the low frequency region below about 200Hz, while maintaining an in- 
phase relationship at higher frequencies, and additionally includes equalizing networks to 
introduce a small bass boost to compensate for the ~3dB reduction in bass energy along the center 
25 line of the room that the quadrature circuit often produces. 

In another aspect, an electronic signal processing system is provided for driving first and 
second power amplifiers and first and second loudspeaker systems in a listening room. First and 
second input terminals are adapted to receive a stereophonic pair of audio input signals. 
Corresponding first and second output terminals produce a stereophonic pair of output signals for 
30 driving the first and second power amplifiers and the first and second loudspeaker systems. 
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Circuitry is provided for varying the phase relationship between said first and second output 
signals such that their phase difference tends towards zero at high audio frequencies and increases 
to approximately quadrature or 90 degrees phase difference at low frequencies, the gain of the 
circuitry between each said input terminal and said corresponding output terminal being 
approximately constant at all audio frequencies, thereby providing increased apparent 
spaciousness of low frequency sounds reproduced by said loudspeakers in said listening room. 

In another aspect, the signal processing system further comprises a circuit between each 
input terminal and said corresponding output terminal wherein there is an equal bass boost applied 
to both said first and second output signals of approximately 3dB for those frequencies where the 
phase difference is approximately 90 degrees thereby maintaining approximately constant sound 
pressure at all frequencies along the lateral center line from front to back of said listening room. 

In another aspect, the phase difference of approximately 90 degrees at low frequencies is 
obtained without active electronics, by combining differences in the loudspeakers themselves, 
such as differences in enclosure volume, port area, cone mass, surround stiffness, and crossover 
components. 

Although research has shown that relatively steady low frequency sounds are difficult to 
localize, percussive sounds include higher frequency components that are easier to localize. The 
ability to localize percussive sounds increases as the frequency rises. The addition of a 90 degree 
phase shift as provided by this invention sometimes alters the localization of all sounds, and in 
particular percussive sounds. The 90 degree phase shift results in a time delay in the lagging phase 
channel, which tends to produce localization shifting towards the leading phase channel. Thus the 
shift in localization will determine the highest frequency at which the 90 degree phase shift should 
be applied. For reproduction of classical music, where bass instruments are mostly located on the 
right of the orchestra as heard by the listener, the leading phase shift should be applied to the right 
channel to minimize the perception of this effect. However, popular music is typically mixed so 
that the kick drum (or similar instruments) produces equal levels in both channels, which results in 
a strongly percussive low frequency sound. When such recordings are played in a laterally 
symmetric room with phase coherent loudspeakers, the result is a strong in-head localization of the 
low frequency energy. While this sounds unnatural, it is often considered desirable by recording 
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engineers as evidencing good imaging. The present invention, by reducing in-head localization, 
may make some of these engineers unhappy. In order to make them happy again, an additional 
aspect of this invention is additional circuitry that detects rapidly rising (percussive) sound, and 
temporarily reduces the phase shift. 

An advantage of the invention is to provide a richer, fuller bass sound by exciting more 
room modes. It has been shown that in a laterally symmetric room, the use of a single non-central 
loudspeaker can excite both odd and even lateral room modes, while a centrally placed 
loudspeaker cannot excite the odd modes, and the same is true of two loudspeakers driven in phase 
and symmetrically placed. The sense of spaciousness in the sound reproduction is enhanced, 
particularly in smaller listening rooms, by exciting both the odd and even lateral modes. 

Another advantage of the invention is that it reduces unnatural in-head localization of 
certain low frequency sounds. 

A further advantage of the invention is that it enhances the bass response perceived by the 
listener, and provides an optimal phase relationship between the loudspeakers in the low bass 
region. 

An additional advantage of the invention is that it enhances the spaciousness perceived by 
the listener for low frequency sounds, while maintaining the sound pressure level the same as for 
two subwoofers in the same corner of the room. 
Brief Description of the Drawings 

The novel features believed characteristic of the present invention are set forth in the 
appended claims. The invention itself, as well as other features and advantages thereof, will 
best be understood by reference to the following detailed description of an illustrative 
embodiment when read in conjunction with the accompanying drawing figures, wherein: 

FIG. 1 shows a representation of a typical stereophonic loudspeaker system in a listening 

room; 

FIG. 2 illustrates the perception of low frequency spaciousness caused after a direct 
sound decays in a large room, due to background spatial impression (BSI) created by 
reverberant sounds; 

FIG. 3 show the effect of interfering delayed lateral sound reflections in a reverberant 
room causing localization fluctuations, as a function of angle from the front, which results in an 
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impression of spaciousness; 

FIG. 4 illustrates the variation in interaural time differences (ITDs) from interference 
between an asymmetric lateral mode and a medial mode with a 90 degree phase shift; 

FIG. 5 shows the absolute ITD and the net ITD for a typical listening room with a wall 
5 reflectivity of 0.9 as a function of lateral listener position, using a single loudspeaker; 

FIG. 6 shows the absolute and net ITDs when the side wall reflectivity is reduced to 0.7, 
as a function of lateral listener position; 

FIG. 7 shows the absolute and net ITDs in the same room as for FIG. 5 when the front 
and back wall reflectivity is reduced to 0.6, keeping the side wall reflectivity at 0.9, as a 
1 0 function of lateral listener position; 

FIG. 8 shows the absolute and net ITDs in a room as a function of its width, with a 
constant length and for a central listener position; 

FIG. 9 shows the absolute and net ITDs for a room with larger lateral dimensions as a 
function of room width; 

15 FIG. 10 shows the absolute and net ITDs in the same size room as in FIG. 5, produced by 

two spaced loudspeakers, as a function of lateral listener position, when the speakers are driven 
in phase; 

FIG. 1 1 shows the absolute and net ITDs in the same room but with the two speakers 
driven in opposite polarity, as a function of lateral listener position, showing extreme phasyness; 
20 FIG. 12 illustrates a method for computing the impulse response of a room for each ear, 

as a function of listener and speaker positions, wall reflectivities, and image locations; 

FIG. 13 shows a typical impulse response calculated for the left ear and the right ear; 
FIG. 14 shows the ITD=s and phase difference between the left and right ears as a 
function of frequency, calculated from the FFT of the impulse responses; 
25 FIG. 15 shows a detailed schematic of a preferred embodiment of the present invention; 

and 

FIG. 16 shows the absolute gain (equal in both channels) and the phase difference 
between the left and right channel outputs of the circuit of FIG. 15 when the two inputs are 
driven in common, as a function of frequency. 
30 Detailed Description of Exemplary Embodiments 
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Referring to FIGs. 1 and 15, for example, the system and method of the present invention 
is the result of consideration of various acoustical phenomena which may be described in 
relation to the listening room and typical audio installation shown in FIG. 1 . 

The listening room 1 of FIG. 1 shows a listener 2 seated in a preferred listening position 
5 3, often on the lateral center line of the room 4. A left speaker 5 and a right speaker 6 are placed 
symmetrically in relation to the center line 4, so that the lines joining them to the listener 2 
make equal angles q with the center line. 

In many such listening rooms, the loudspeakers 5 and 6 are full-range loudspeakers, but 
in some systems, they only reproduce the higher audio frequencies, while the bass frequencies 
10 are either combined and fed to a single subwoofer 7 or else are fed separately to two subwoofers 
8 and 9 at left and right of the center respectively. The satellite-subwoofer system using a single 
subwoofer 7 is not the type of system to which this invention can be applied. Although the 
subwoofer 7, shown dotted, is here placed on the center line, it may be advantageously placed 
off-center (and preferably towards the right) as will be shown later. Both subwoofers 8 and 9 in 
1 5 FIG. 1 may alternatively be placed in the same corner of the room. One of the purposes of this 
invention is to make such a placement disadvantageous. 

Systems employing either full-range loudspeakers 5, 6, or the combination of satellites 5, 
6, with corresponding subwoofers 8, 9, may provide more spacious and natural sound 
reproduction using the principles embodied in the present invention. 
20 It will be helpful to the background of this invention to investigate the perception of 

spaciousness in large and small rooms, particularly at low frequencies. In particular, the 
inventor refers to the following technical papers, incorporated by reference in their entirety, in 
which he has described such research: 

1 . Griesinger, D, " Measures of Spatial Impression and Reverberance based on the 

25 Physiology of Human Hearing," Proceedings of the 1 1th International Audio Engineering Society 
(AES) Conference, May 1992, pp. 1 14-145. 

2. Griesinger, D, « Spaciousness and Envelopment in Musical Acoustics," AES preprint No. 
4401, presented at the 101st AES Convention, November 8-1 1, 1996. 

3. Griesinger, D, "The Psychoacoustics of Apparent Source Width, Spaciousness and 
30 Envelopment in Performance Spaces," Acta AcusticaNo. 4, pp. 605-746 July/August 1997. 
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4. Griesinger, D, "Spatial Impression and Envelopment in Small Rooms," Audio 
Engineering Society preprint No. 4638, 103 rd Convention, New York September 26-29, 1997. 

5. Griesinger, D, "Speaker Placement, Externalization, and Envelopment in Home Listening 
Rooms," Audio Engineering Society preprint No. 4860, 105th Convention, San Francisco, 
October 1998. 

FIG. 2, taken from reference [4], illustrates the phenomenon called background spatial 
impression, or BSI. 

Sounds received at the left and right ear canals are detected and separated into multiple 
foreground streams and a background stream. The foreground streams comprise sound events 
having a similar timbre, localization, or meaning, such as speech from a particular speaker, or 
sounds from a specific musical instrument. Sounds which do not have the coherence that 
identifies a foreground stream are perceived as a single background stream. It is the spatial 
properties of this background stream which primarily give rise to the perception of spaciousness 
and envelopment. Where it is not possible to separate a background stream, fluctuations in 
interaural time and intensity differences due to delayed lateral reflections create an impression of 
spaciousness. 

The theory of interaural fluctuations and spaciousness perception has been presented at 
length elsewhere [1, 2, 3]. For the sake of clarity it will be briefly summarized here. Horizontal 
localization in human hearing is primarily determined by the interaural time differences (ITDs) 
and intensity differences (IIDs). Delayed lateral reflections interfere with the direct sound and 
cause the IIDs and ITDs to fluctuate. The amplitude of the fluctuations is on the average 
proportional to the energy of the reflection. The rate of the fluctuation depends on many factors, 
among them the bandwidth or frequency vibrato of the musical signal. Fluctuations lower than 
about 3Hz are perceived as source motion, and fluctuations faster than this cause either a 
broadening of the width of the source, or the perception of a narrow source in the presence of a 
surround. 

When the direct sound is continuous and cannot be split into separate sound events through 
variations in amplitude or pitch, reflected energy and the corresponding interaural fluctuations 
give rise to a sense of envelopment or spaciousness that is perceived as connected to or associated 
with the direct sound. The surround impression can be fully enveloping c fully surrounding the 
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listener. However as the loudness of the sound source increases, the strength of the surround 
impression does not change. Changing the direct to reverberant ratio changes the strength of the 
impression, but changing the overall level does not. We call this form of spatial impression CSI, 
or continuous spatial impression. 
5 When the incoming sound consists of separable sound events that can be formed into a 

foreground stream, the perception of spaciousness splits into two different forms depending on the 
delay of the reflections. Reflected energy that arrives during a sound event, and within 50ms of 
the end of the sound event, is perceived as part of the sound event itself. We call this spatial 
impression ESI, or early spatial impression. ESI, like CSI, does not change as the strength of the 

1 0 source is varied. However, unlike CSI, ESI is usually not fully enveloping. ESI is perceived as 
strongly occupying the same direction as the source sound. 

ESI is the spatial impression of small rooms. We are not usually aware of ESI, but one can 
train oneself to hear it. When you listen to another person talk in a small room you can be very 
much aware that you are in a small room. However, the sound of the room is perceived as being 

15 in the same direction as the speaker. The room impression is bound to the sound of the voice 
itself. ESI is neither spacious nor enveloping. 

When the room is large enough or reverberant enough that a substantial amount of the 
reflected energy arrives more than 50ms after the ends of the direct sound events, background 
spatial impression (BSI) is perceived. Background spatial impression arises from sound that 

20 arrives at least 50ms after the end of the direct sound. If this delayed energy is spatially diffuse 
(coming from all directions) then the fluctuations in the IID and the ITD are maximum, and a 
strong sense of envelopment results. 

Unlike ESI and CSI, BSI is not bound to the direct sound that creates it. BSI is perceived as 
separate from the direct sound, and as independent of the direct to reverberant ratio. BSI depends 

25 strongly on the absolute strength of the source, as well as on the reverberant level and the profile 
of the decay. To produce BSI it is essential to have a stream of direct sound events. BSI does not 
occur during the decay of a single sound event (or during the decay at the end of a piece of music.) 
The decay of a single sound event is itself an event - one can devote one's full perceptive powers 
to it. BSI can only be heard in the gaps between foreground events^ and has distinct properties 

30 different from foreground events. For example, if BSI has a pitch, it can have only one pitch. A 
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foreground event, such as the decay of a chord, can be heard as a combination of pitches. 

In the acoustics of small rooms, as in the acoustics of large rooms, we are primarily 
interested in BSI when we speak of envelopment. Thus we are interested in the spatial properties 
of sound which is at least 50ms delayed. In fact, the sensitivity of the ear to BSI rises to a 
5 maximum at about 160ms, after the ends of foreground sound events. 

In listening rooms there is very little energy in the room 160ms after the ends of sound 
events. Thus if we are to have BSI at all, it must be supplied by the recording. It is the reverberant 
component of a sound recording that produces BSI in a small room. FIG. 2 shows how the 
detection process works. 

10 The actual end of a sound, A, is perceived typically by the listener when the sound has 

decayed to half power or about -3dB, at B in FIG. 2. As the sound dies away, shown by the 
gradually sloping line from B to the lower right of the figure, the sensitivity of the ear to 
reflections and reverberances from other directions enveloping the listener, the background 
sensitivity, follows the line C, increasing to a maximum value D. In the region E to the left of this 

15 line, the reflections contribute to ESI and CSI, but after 50ms from the end of the event, reflections 
in the shaded area F result in BSI, with their maximum at about 1 60ms after the end of the direct 
sound. Thus, there must be a sufficiently long gap after the end of the sound to allow for 
perception of the BSI effect. 

The perception of localization fluctuations which result in ESI and CSI effects is illustrated 

20 by the curves in FIG. 3, which were obtained from a computer simulation. Spaciousness in small 
rooms where there are no long-delayed reflections is mainly due to these ITD and IID fluctuations. 
In this figure, the localization fluctuations depend strongly on the angle of a single reflection from 
the front, as shown in curve A for a frequency of 1000Hz, reaching a maximum at about 45 
degrees. The dotted curve B shows the maxima at about 22 degrees and at 90 degrees. At low 

25 frequencies, the optimum angle for a single reflection is fully to the side of the listener, at 90 
degrees. 

This angular dependence of the interaural fluctuations is a key to understanding spatial 
impression in sound reproduction systems at high frequencies. As an example, if a continuous 
broadband stereo signal such as applause is played through a standard stereo speaker system 
30 outdoors or in an anechoic chamber, and we assume the listener is centered, and the speakers 
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subtend +/-30 degrees, because the sound is continuous we expect that the spatial impression will 
be that of CSI. CSI at high frequencies is usually fully enveloping c it sounds as if it is coming 
from all around the listener. However it can be weak if the reflected energy which creates it comes 
from the medial plane. In this example we are assuming we have very little reflected energy The 
5 fluctuations, if present at all, must be generated from the direct sound. 

When we perform this experiment we perceive an image of the applause located between 
the loudspeakers. This is what we expect. However this perception is accompanied by a fully 
enveloping surround. The presence of sound that clearly comes from the sides of the listener c 
when there is no source of sound from the sides c is uncanny, On first hearing this one might 

1 0 assume that there had to be some sort of crosstalk canceler in the system, but this is not the case. 
The experiment clearly shows that it is possible for two loudspeakers at +/-30 degrees (or less) to 
produce a fully enveloping signal with no electronic or acoustic support. 

The reason is that at some frequency bands the speakers are at the optimum angle for 
producing interaural fluctuations. In our example the optimum frequencies will be about 1500Hz, 

15 which is just in the middle of our hearing system's maximum sensitivity for speech perception. 
CSI in this band will be very strongly heard. Passing the applause through a variable band pass 
filter confirms the result. Applause limited to below 500Hz produces no surround from the stereo 
pair. As the frequency is increased to the 1500Hz band a very strong surround is generated. As 
the frequency rises further the surround impression decreases, only to rise again at about 3000Hz. 

20 The theory of interaural fluctuations predicts this pattern. 

Thus a stereo pair can produce a strong sense of surround with no acoustic support from the 
room, at least at frequencies of about 1000 to 2000Hz. Such a speaker pair is also capable of 
producing BSI, if the reverberant component of the recording is fully decorrelated in the left and 
right speakers. The degree of correlation of the reverberation is important. If the reverberation is 

25 monaural the sound from the two loudspeakers will not cause the ITD and IID at the listener's ears 
to vary. No CSI or BSI will be produced regardless of frequency. In a real room there may well 
be some CSI or BSI, as lateral room reflections will contribute to the production of interaural 
fluctuations. This contribution from the room will be higher if the listener is not centered. 
However maximum spaciousness occurs when the reverberant component of the recording is fully 

30 decorrelated. In the absence of room reflections any single pair of loudspeakers can only 
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maximally excite spaciousness from a few frequency bands. If we want all frequencies to be~ 
equally spacious, we need an array of decorrelated loudspeakers. 
From these observations we can deduce three simple rules: 

For maximum spaciousness at high frequencies the reverberant component of a recording 
should be fully decorrelated. 

The frequency band for maximum spaciousness from a stereo loudspeaker pair depends on 
the angle between the loudspeakers. The wider the separation the lower the first spacious 
frequency, and the higher the perceived spaciousness. 

Ideally, the reverberant component of the recording should be reproduced by an array of 
decorrelated loudspeakers around the listener. 

The frequency dependence of the perception of spaciousness is an important reason that 
stereo reproduction works. If it not possible to perceive some spaciousness from a stereo pair 
there would be much less of a reason to switch from monaural. However two-loudspeaker stereo 
is not an optimal solution at high frequencies. Ideally we want the reflected sound field to be 
diffuse - coming from all around the listener. A diffuse sound field produces maximum interaural 
fluctuations at all frequencies, and the result is both more effective and more natural. In a typical 
room lateral reflections can help spread the reverberant energy in a recording. But if the 
reverberant component of the sound can be reproduced through multiple loudspeakers the results 
are superior c as long as the sound driving the loudspeakers is laterally decorrelated. 

The behavior of spaciousness from a stereo pair below 500Hz is much more complicated. 
In an anechoic environment spaciousness is not produced c nor are any ITDs greater than those 
produced by the front loudspeakers. However it is immediately obvious that in a typical room 
some spaciousness is audible, even at low frequencies. This spaciousness comes from fluctuations 
in the ITDs, just as it does in a concert hall. But the method that produces these fluctuating ITDs 
is very different. Below 500Hz sound travels from the loudspeakers to the listener by way of 
standing waves c the room modes. (As an academic aside, the Schroeder frequency is often 
much lower than 500Hz, but the method for producing ITDs described here works at frequencies 
above the Schroeder frequency.) 

Room modes have interesting properties. Of primary interest to us here is that they have 
constant phase. For example, assume we can drive a room with a sinusoid from a loudspeaker, 



WO 99/33173 



PCT/US98/27174 



and tune the frequency to a single mode of the room c say the lowest fundamental length mode. 
This mode is characterized by a null at the center of the room. With pressure maxima of opposite 
sign at the two ends, if we measure the phase of the pressure in the room, we will find that the 
phase is constant (except for a change in sign) throughout the room. The amplitude will vary and 
the sign changes on opposite sides of the null in the center of the room, but there is no phase shift 
of any kind as we move the test microphone around. 

This is a bizarre result. We would expect that as we move the test microphone away from 
the loudspeaker there would be a time delay in the pressure corresponding to the distance to the 
loudspeaker. This does not occur. There is no time delay at all, just a variation in the amplitude. 

This lack of time delay is audible. Localization at low frequencies is almost entirely 
determined by the ITD, or the time difference between the two ears. If the phase of the standing 
wave is constant, there can be no ITD. The standing wave is not localizable. It sounds inside the 
head. 

However, it is not entirely true that there is no ITD, even for a single standing wave. If one 
puts one f s head exactly at the null point of the standing wave the phase on either side of the head is 
opposite, and there is a phase difference of 1 80 degrees. This corresponds to a time difference of 
half the period. One would expect an odd localization shift exactly at the null point of the standing 
wave. Unfortunately it is quite difficult to hear anything at all at the null point of the standing 
wave. The pressure there is minimal. It would appear that we could perhaps hear spaciousness at 
low frequencies if we could put ourselves at the exact null point of the lateral standing waves c 
except that in that case we would hear nothing at all. 

From this we can develop two simple rules for single standing waves: 

One cannot hear spaciousness from a standing wave unless one is at a pressure null. 

One cannot hear spaciousness from a standing wave if one is at a pressure null. 

Fortunately, this is not what happens in real rooms. In real rooms there are (nearly always) 
several modes in operation at the same time. In this case the results are entirely different. 
Consider the medial modes: These are the vertical modes and the front/back modes, and all cross 
modes which do not involve the lateral direction. These modes are not by themselves capable of 
creating an ITD, as they must excite both ears of a listener identically. However they can interfere 
with the lateral modes. Most important, they can create audible pressure at the pressure nulls of a 
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lateral standing wave. The medial modes can supply enough sound pressure to make the ITD 
created by the lateral modes audible. 

The combination of a medial mode and a lateral mode can produce a very significant, and 
audible, ITD. For a maximum result the medial mode should have a +/-90 degree phase shift with 
respect to the lateral mode. Since all room modes have constant phase, it is not obvious how such 
a phase shift can arise. However, musical sounds at low frequencies are almost never precisely 
adjusted to the exact center frequencies of the room modes. When the excitation and the room 
modes are not exactly at the same frequency a phase shift results which is constant throughout the 
room. This phase shift is close to +/-90 degrees about half the time. 

FIG. 4 shows the results obtained from a Matlab program which investigates this 
interference between medial and lateral modes. The maximum ITD for a large space is shown as 
the broken line A. The medial mode has a 90 degree phase shift relative to an asymmetric lateral 
mode. When the medial amplitude is high, at OdB relative to the lateral mode (solid curve B), the 
localization fluctuations produced are fairly small. When the medial mode is 6dB lower, shown 
by the dotted curve C, it is possible for the localization fluctuations to exceed the maximum for 
large rooms. Finally, with the medial mode at -12dB, significantly larger fluctuations occur. This 
can be perceived as Aphasyness.@ 

The results of the program can be summarized as follows: 

The maximum shift in ITD results when the medial mode(s) and the lateral mode differ in 
phase by +/-90 degrees. 

The maximum shift in ITD results when the medial mode(s) is weaker than the lateral 

mode. 

The maximum shift in ITD occurs in a broad region around the null of the lateral mode. It 
is not confined to exactly the null point. 

The maximum shift in the ITD can be quite large c larger in fact than the maximum shift 
of about +/-1 ms that is created by natural hearing in large spaces. 

In a lateral standing wave there can be several null points. However, for a listener in the 
center of the lateral dimension of a room, only asymmetric lateral modes will have a null at the 
position of the listener. 

All asymmetric lateral modes will have a null along the center line of the room. 
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Asymmetric lateral modes can be created only by the antiphase component of the 
loudspeaker signals. Medial modes can be created only by the in-phase component of the 
loudspeaker signals. 

Note that for shifts in the ITD, the line of lateral symmetry in a listening room becomes a 
5 location of highly unusual significance. People who take extraordinary care in setting up their 
room to be symmetric may be in for an unusual experience. 

The following is the Matlab program used to generate the results shown in FIG. 4. In this 
listing, text following a percent (%) symbol is a comment. 
Matlab program for investigating lateral/medial interference 
10 % program to plot the weak ITD variation from interference between 
% a medial mode and a lateral mode. The phase difference between 
% two modes is p, preset to pi/2. User must type in cm, or the 
% media! amplitude relative to the lateral amplitude. Typical values 
% are 1, 0.5, and 0.25. For plotting convenience we will assume a 
1 5 % distance between the ears of 1 7cm, and a base frequency of 62.5Hz, 
% yielding a ear distance of 2 radians.% speed of sound is 1 lOOft/sec. 
% The ITD is determined by summing the medial and the lateral waves, 
% assuming constant phase within one wave, but also assuming that the 
% listener is near a pressure null (and corresponding sign flip.) 
20 % The waves are calculated in 1 .2 degree steps, and then the ITD 
% is determined by finding the angle where there is a positive 
% zero crossing 

% this program then moves the listener progressively from the null 

% position in 1.2 degree steps, and plots the decrease in the ITD 
25 % as frequency increases the ear distance in radians increases 

% linearly, and thus the phase shift increases linearly. However 

% the resulting time delay when you convert back to time turns out 

% to be closely frequency independent. Thus we will calculate these 

% effects for one frequency, 62.5 Hz. The change with distance from 
30 % the center is plotted in degrees*2. Thus the actual distance you 
move from the null is frequency-dependent, with low frequencies 

% giving very great distances, and higher frequencies less. 

% a slight modification of this program can vary p, and show that 

% the maximum ITD results when p=pi/2. 
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% define cm=medial amplitude, and df = ear delta in radians 
P = Pi/2; 

df = 0.2; % the ear distance in radians at 62.5Hz. 

% set up a for loop on the distance from the center, in half degree steps 
fory=l:90 

% calculate the left ear and right ear amplitudes of the lateral wave 

cy = 2*pi*(y-l)/720; % first the amplitude angle 

cl=sin(cy); % the left amplitude 

cr = sin(cy-dt); % the right amplitude 

a = 0:720; % calculate the waves in half degree steps 

al = pi*a/360; 

% now find the pressure at each ear by summing the lateral and medial 
1 = cl*sin(al)-cm*sin(al-p); 
r = cr*sin(al)-cm*sin(al-p); 

% find the zero crossing by first throwing away positive values 
x=l; 

if(l(x)>0) 

while (l(x)>0) 
x = x-1 

end 
end 

% now find the phase value of a positive zero crossing 
while (!(x)>0) 

ll=x 

x =x+l 
end 

% now do the same for the right ear 
x=l; 

if(r(x)>0) 
while (r(x)>0) 
x - x-1; 
end 
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end 

while (r(x) 0) 
rl =x; 
end 

5 % now find the ITD by comparing 11 and rl circularly 
% this routine finds the absolute value of the ITD 
if(ll>600&rl<50) 
diff=720-ll-rl; 
elseif(rl>600&ll<100) 
10 difT=720-rl-ll; 
else 

diff-U-rl; 
if(diff<0) 
dhT= rl-Il; 
15 end 
end 

% to make a vector of the resulting ITDs 

diffl(y)=diff; 

end 

20 

% convert to milliseconds from phase shift in half-degrees 
time « 1000*diffl/(720*62.S); 

pIot(time,'w') 
25 ylabel('Max ITD shift in milliseconds') 

xlabel('distance from room center in degrees*2) 

We localize sounds at low frequencies through the ITD. It has been shown that humans can 
localize low frequency sounds in a room. The human localization mechanism is highly dominated 
30 by the ITD of a transient, if transients exist in the source material. Transients are not corrupted by 
reflections if the room is large enough c and 1 0ms of reflection free time is enough. Thus we can 
localize a drum hit even if it has been sharply band limited to frequencies below 200Hz (or even 
60Hz). However not all bass instruments are drums. In fact, most bass instruments have a rather 
slow attack, sufficiently slow that in most playback rooms the transient is not separately detected. 
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For these instruments we must use the information conveyed by the room modes if we are to 
localize them. 

We can see from FIG. 4 that it is possible to create an ITD by coupling lateral and medial 
modes. Consider the case when a single loudspeaker of a stereo pair emits a low frequency sound, 
5 and the listener is positioned along the center line of the room. As luck would have it, the sign of 
the ITD thus created is appropriate to the direction of the loudspeaker. The magnitude of the ITD 
however depends on details of the listener position, and the coupling between modes. If the low 
frequency sound is reasonably monochromatic it is not possible to predict what ITD will be 
produced. However, on the average, some plausible localization may result. 
10 Of particular importance is the listener position in the room. If the listener is along the 

center line, then all asymmetric lateral modes will have a null. A single loudspeaker will excite 
most asymmetric modes, as well as most medial modes. There is a very good chance that an 
audible ITD will be produced, and the sound will appear to come from the correct side of the 
listener. 

15 However as can be seen in FIG. 4, if the listener is along the center line, and medial modes 

are suppressed by use of a dipole loudspeaker or acoustic treatment in the room, it is quite possible 
that the ITD will be larger than the maximum in natural hearing. The result is an unpleasant 
sensation of phasyness, even with a monaural source, namely a single loudspeaker. 

This was not perceived to be a problem in the days of monophonic playback, and is not a 

20 problem with stereophonic playback even now. The reason is that in a monophonic system, if the 
room was symmetric, the loudspeaker was also near the center line of the room. Such a 
loudspeaker cannot excite asymmetric lateral modes. In modern stereo systems we use two 
loudspeakers. They can only excite asymmetric lateral modes if they are out of phase and this 
condition has been rigorously avoided by most sound engineers. This is particularly true at low 

25 frequencies, which is why using two separately driven subwoofers such as 8 and 9 in FIG. 1 is 
often no improvement over using a single subwoofer 7 located in the middle. 

In summary, it is possible to localize low frequencies in a small room, even in the absence 
of transients. To do it one must have a signal that is uniquely in one loudspeaker or another, and 
one must be near the center line of the room. Off the center line ITDs can still occur, but their 

30 magnitude (and sometimes the sign) is random. For many forms of music, the ITDs can fluctuate. 
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The result is not localization but spaciousness. 

The next group of figures illustrates a method for assessment of the low frequency 
spaciousness by means of a weighted sum of the ITD multiplied by the pressure, and divided by 
frequency. For these examples we sum over the frequency range from 30Hz to 300Hz. 
5 Typical listening rooms have dimensions of about 1 0 to 25 feet, the walls, floor and ceiling 

being quite reflective at low frequencies, with absorption in the 10% range. This results in 
multiple reflections with little energy being absorbed, creating standing wave patterns which have 
pressure peaks and dips in various parts of the room. Low frequency sound propagates from the 
speakers to the listener through these standing waves. While the pressure at the listener position is 
10 augmented by the standing waves, this augmentation is not uniform with frequency, in general 
having many peaks and dips in the frequency response. 

The modal patterns of small rooms were studied extensively by Sabine, Morse, Beranek and 
others. However, we currently are more interested in the effects produced in small rooms by a pair 
of loudspeakers rather than the pressure produced by a single loudspeaker in a room. 
15 We first calculate the ITDs generated by a single loudspeaker placed 2' from a corner along 

the front 1 2' wall of a room 1 5' long, where the listener is & from the front wall and at various 
distances from the 15* wall nearest to the loudspeaker. The method will be described in more 
detail later. 

FIG. 5 shows the results for a room with wall reflectivity of 0.9. The solid curve A shows 
20 the absolute ITD as a function of listener position (at intervals of 1 .5 M .) The dotted curve B shows 
the net ITD for the same locations. For these purposes, the absolute and net ITDs are defined by 
the following method: 
Net_itd_sum = sum( ITD * pressure / frequency) 
Absolute_itd_sum = sum(abs(ITD) * pressure / frequency) 
25 Reference = sum(pressure/frequency) 
Netjtd = Net Jtd_sum/reference 
Absoluteitd = AbsoIute_itd_sum/reference 

where the measured or computed ITD at each frequency is combined and weighted as indicated. 
The ITD can be computed from the long FFT of the impulse response at each ear, as will be 
30 described below. 

In FIG. 5, the absolute and net ITDs (A and B respectively) are similarly shaped and are 
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high. In FIG. 6, the reflectivity of the side walls of the same room has been reduced to 0.7, with a 
dramatic effect on the absolute ITD curve A, which has been reduced substantially. In FIG. 7, the 
reflectivity of the front and back walls of the same room has been reduced to 0.6, with side wall 
reflectivity of 0.9. This results in an increase of both the absolute and net ITDs (A and B 
respectively.) 

FIG. 8 shows the ITDs for rooms with different widths, all 15' long. The single speaker is 
4' from the center line of the room (see 4 in FIG. 1 .) The listener is centered and is 6.9' from the 
speaker (about in the same position as in FIGs. 5-7.) A large peak in the net ITD curve B 
coincides with the Agolden ratio@ of 1 .6:1, at about 10' width, while the absolute ITD curve A 
peaks at US-IT. 

FIG. 9 shows the same length room, but with larger widths from 14' to 20'. There is a peak 
at 15' corresponding to a square room, but after this, both absolute ITD curve A and net ITD curve 
B fall off with increasing width. A corollary is that spaciousness at low frequencies is reduced 
when the speakers are placed on the long wall of the room. 

If we excite the same room as used in FIG. 7 with two loudspeakers, one in each front 
corner, with the listener 6' from the front wall, with the signals in phase, it is not possible to excite 
odd order asymmetric lateral modes, including the lowest frequency lateral mode. Only the even 
order modes can be excited. When the speakers are driven in antiphase, the odd order modes can 
be excited, but not the even order modes. 

FIG. 10 shows the ITDs for the in-phase case, and FIG. 1 1 shows them for the antiphase 
case. In the first case, the ITDs are very low, and the low frequencies do not seem to fill the room, 
but localize inside the listener's head. In the second case, the absolute ITD becomes very high for 
a listener near the center line of the room, and the result is extreme phasyness, which is 
uncomfortable for most people. 

Because only half of the lateral room modes are excited in either case, the frequency 
response is more uneven than when all the room modes are driven, as if by a single loudspeaker. 
For these reasons, it has been recommended that when two subwoofers are used, they should be 
placed together in one corner of the room. This is not possible with full-range loudspeakers. 

Before describing the invention which has been made to solve this problem, we will look at 
the assumptions and methods used to calculate the ITDs in the previous discussion. FIG. 12 
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shows a listening room 1, with a listener 2 situated at distances P Y from the left wall and P v from 
the front wall. A single loudspeaker 10 is situated at S x from the left wall and S y from the front 
wall. Assuming that the walls are highly reflective, the side wail reflectivity being aw and the 
front and back walls having reflectivity al, there is a reflection in the left side wall that produces an 
apparent image 12 of the speaker (shown dotted.) Sounds reach the listener directly from the 
speaker via path 14 and from the image via path 16. These two different paths are of different 
lengths, and there are many other paths which can be excited by the multiple images of the 
loudspeaker produced by the reflections in the various walls of the room. Each of these reflections 
is attenuated by varying amounts, and is delayed by various times. 

If the loudspeaker is driven with a very short impulsive sound, each reflected impulse being 
plotted at its arrival time and with the amplitude, dependent on its attenuation, we obtain the 
impulse responses for the left and right ears, as shown in FIG. 1 2. These are slightly different, as 
the speaker is not placed symmetrically in the room. 

In FIG. 13, the phase difference (solid curve A) and interaural time difference (dotted curve 
B) are shown as a function of frequency, these being derived from the impulse responses as in the 
following Matlab program, using a long FFT: 

FTL = FFT(left impulse response) 
FTR » FFT(right impulse response) 

phase Jeft = arctan(imag(FTL)/reaI(FTL)) 
phase_right = arctan(imag(FTR)/real(FTR)) 

forxl = 1:500 

phase_diff(xl) - phasejeft(fftl(xl)) - phase_right(fftr(xl)); 
ifphase_diff(xl)>0 
if phase_dif¥(xl) > pi 

phase_diff(xl) = phasej3iff(xl)-2*pi; 

end 
else 

if phase-diff(xl) < -pi 

phase_diff(xl) - phase_diff(xi)+2*pi; 
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end 

end 
end 

delfrq - 5000/4096 

deltime = phase_difT./(2*pi*delfrq*n) 

The object of this invention is to enable both loudspeakers in the room to excite all of the room 
modes. To do this, the speakers cannot be driven in phase, as they will then fail to excite the odd 
lateral room modes, but neither can they be driven in antiphase, which would not allow the even 
modes to be excited. There is, however, an alternative. If the speakers are excited in a quadrature 
or 90 degree phase relationship, the sounds from each loudspeaker are orthogonal in the wave 
space and each speaker therefore excites all the room modes independently. Only when the 
symmetry is imperfect can some of the modes be canceled between the speakers, or if the phase 
difference is not exactly 90 degrees. 

Constant phase differencing networks are well known in the art, and are usually designed to 
approximate a quadrature phase difference between the left and right output over a given range of 
frequencies. For a single decade of frequencies, in this case between 20Hz and 200Hz, the phase 
difference may be made nearly constant using only one pole in each of a pair of all-pass phase 
shifters. More complex arrangements can be made, using two or more poles per network. The 
advantage of additional poles would be to reduce the inevitable phase difference at frequencies 
outside the band of interest. Ideally, the phase difference should be essentially zero above about 
400Hz. 

According to the invention, the circuit 30 of FIG. 15 includes a phase shifter comprising op- 
amp Ul, resistors R1-R3 and capacitor CI , this being driven from a first input terminal 32. A 
second phase shifter comprising op-amp U3, resistors R6-R8 and capacitor C3, is driven from the 
second input terminal 34. With the values shown, the signal in the first channel is transmitted 
through Ul with a phase inversion at low frequencies, but at high frequencies the signal is passed 
in phase to the output. At the natural frequency associated with the C1-R2 time constant, the 
phase shift is 90 degrees. The amplitude response is exactly unity gain for all audio frequencies 
(subject to tolerances of the resistors.) Similarly, for the second channel, the phase shift becomes 
90 degrees at a lower frequency than that in the first channel. Between these two frequencies, and 
for some little range outside them, the phase shift between the two outputs remains at or close to 
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90 degrees. 

Thus the circuit of FIG. 15 provides two outputs from the op-amps Ul and U3 that are in 
quadrature phase relationship over a significant range of frequencies. 

Because the room modes are all being excited by both loudspeakers, there is a considerable 
5 increase in the bass energy in the room. As the loudspeakers are normally expected to be used in a 
phase-coherent relationship and are designed to give a flat bass response in this mode of operation, 
when their sound pressure levels sum in phase, the use of a quadrature phase between them at low 
frequencies will reduce the coherent bass output level by 3dB . Therefore, it is necessary to re- 
equalize the room response by performing a 3dB boost at the lower frequencies, in both channels. 

10 In the first channel, the output signal from op-amp Ul is passed to the boost stage 

comprising op-amp U2, resistors R4 and R5, and capacitor C2. At high frequencies, C2 has low 
impedance, and the gain through this stage is unity (OdB). At low frequencies, the gain is (R5 / R4 
+ 1), or 1.34, approximately 3dB. The roll-off frequency is chosen to match the effect of the 
increasing phase difference between the two channels in the transition region between about 

15 400Hz and 200Hz. The output of this boost stage is applied to a first output terminal 36 of the 
circuit 30. 

An identical boost stage comprising op-amp U4, resistors R9 and RIO, and capacitor C4, is 
placed in the second channel to receive the signal from op-amp U3, and the output of op-amp U4 
is applied to the second output terminal 38 of circuit 30 of FIG. 1 5. 
20 FIG. 16 shows the frequency response and the phase difference plotted against frequency, 

and also the amplitude produced when the channels are summed together. Curve A shows the 
output amplitude, for IV AC input, which is 1.34V at low frequencies, 

and falls off to 1 .09 at 2kHz (asymptotic to 1 .0V at high frequencies. Both channels have identical 
frequency response, as the delay stage is all-pass and only the boost stage has additional low 

25 frequency gain. 

Curve B shows the phase difference between the outputs, as simulated using PSPICE. 
Below 1 50Hz the phase remains within about " 12e of the desired 90e phase shift. Above 
300Hz, where the phase difference is 45E,the phase difference falls rapidly, and becomes about 
30e at 500Hz and below 20e at 1000Hz. 

30 The effect of the boost is shown in curve C, which is the sum of the voltages applied to the 
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two loudspeakers, which approximately represents the low frequency sound pressure level in the 
room. This sum is 2.0V at all high frequencies, and varies from this value by 11 0.34V in the low 
frequency region where the phase shift networks and the boost circuits are operational. Thus the 
bass is equalized to the same level as for coherent phase loudspeakers. Since more bass energy is 
5 coupled to the room modes, the actual bass level in the room may be higher, off the center line of 
the room. 

The purpose of this invention is to produce the same increase in perceived spaciousness and 
externalization for a single channel source, or for. a recording where the low frequency energy has 
been recorded in monaural in both stereo channels. These recordings are the rule for popular 

10 music. Adding the phase shift does not reduce the strength of the spaciousness for true stereo 

material, it only increases the effect for monaural material. A method of predicting the increase in 
spaciousness based on the properties of the room modes is presented in the reference quoted 
below. A complete mathematical formulation of this theory, which can be used to support the 
claimed increase in spaciousness from the 90 degree phase shift, was presented orally at the AES 

1 5 convention mentioned in the reference. However the use of a phase shift for this purpose was 
deliberately not mentioned in the talk. 

This increase in spaciousness has been noticed before by many listeners using a standard 
full range system when a recording is played which includes a lot of reverberation at low 
frequencies. The reverberation (which tends to have a rapidly varying random phase relationship) 

20 produces a non-zero phase relationship between the two loudspeakers, and this effectively excites 
the asymmetric modes. In this case the low frequencies seem to be located around the listener, not 
inside the head. 

As mentioned previously, this is not the only possible embodiment of the invention. It 
would be possible to use four or six-pole phase difference networks to obtain tighter control of the 
25 phase shift in the bass. 

As mentioned in the background section above, during testing of the invention it became 
clear that there was some advantage to reducing the phase shift to zero for the duration of sharply 
rising low frequency transients. There is also reason to believe that direction sensing circuitry 
such as is used in matrix surround technology can be used to choose the channel to which the lead 
30 in phase should be applied. Simply detecting which channel was louder and causing the phase 
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shift to lead in this direction will improve low frequency localization in most systems. It will also 
improve envelopment. When neither channel leads on the average, it is probably best to choose the 
right channel as the lead channel. 

The nature of the circuitry shown in FIG. 15 also lends itself to use within a pair of powered 
loudspeaker systems each having within the loudspeaker enclosure either the first or second 
channel circuitry of FIG. 15 and an audio power amplifier for driving the loudspeaker. 

In other embodiments it is contemplated that the phase relationship may be varied to 
produce the desired phase degree differences mechanically. For example, the phase relationship 
varying may be accomplished by mechanical manipulations of the loudspeaker enclosure volume, 
driver cone mass, port area and geometry, and crossover design, that combine to produce the 
desired 90 degree phase difference in the pressure output of the loudspeakers. 

While the preferred embodiments of the invention have been described and illustrated 
herein, many other possible embodiments exist, and these and other modifications and variations 
will be apparent to those skilled in the art, without departing from the spirit of the invention. 
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1 1 . A system for maintaining a constant phase difference between the pressure outputs of 

2 loudspeakers in a listening room, including a signal processor for driving first and second power 

3 amplifiers and first and second loudspeaker systems in a listening room, the system comprising: 

4 first and second input terminals adapted to receive a stereophonic pair of audio input 

5 signals; 

6 corresponding first and second output terminals for producing a stereophonic pair of output 

7 signals for driving the first and second power amplifiers and the first and second loudspeaker 

8 systems; 

9 circuitry for varying the phase relationship between the first and second output signals such 

10 that their phase difference tends towards zero at high audio frequencies and increases to 

1 1 approximately quadrature or 90 degrees phase difference at low frequencies, the gain of the 

12 circuitry between each the input terminal and the corresponding output terminal being 

13 approximately constant at all audio frequencies, thereby providing increased apparent 

14 spaciousness of low frequency sounds reproduced by the loudspeakers in the listening room. 

1 2. The signal processing system of claim 1 further comprising: 

2 a circuit between each input terminal and the corresponding output terminal wherein there is 

3 an equal bass boost applied to both the first and second output signals of approximately 3dB for 

4 those frequencies where the phase difference is approximately 90 degrees, thereby maintaining 

5 approximately constant sound pressure at all frequencies along the lateral center line from front to 

6 back of the listening room. 

1 3. The signal processing system of claim 1 wherein said approximately quadrature phase 

2 relationship between the first and second output signals is maintained over a frequency range from 

3 approximately 20Hz to 200Hz, reducing to zero gradually as the frequency increases above 

4 200Hz. 

1 4. The signal processing system of claim 1 wherein the portion of the circuitry between the 

2 first input terminal and said first output terminal is contained within the said first loudspeaker 
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3 system enclosure along with the first power amplifier, and the portion of the circuitry between the 

4 second input terminal and the second output terminal is contained within the second loudspeaker 

5 system enclosure along with the second power amplifier. 

1 5. The signal processing system of claim 1 further comprising: 

2 circuitry for detecting the presence of rapidly increasing low frequency signal transient 

3 components; and 

4 circuitry for temporarily reducing the phase shift between the first and second output signals 

5 to zero for the duration of said low frequency transients. 

1 6. The signal processing system of claim 1 further comprising: 

2 circuitry for detecting the average amplitude of low frequency audio signals in each of the 

3 first and second input audio signals; and 

4 circuitry for transmitting the one of the first and second audio input signals having the 

5 larger low frequency amplitude through the portion of the signal processing circuitry having 

6 leading phase relative to the other portion thereof to the corresponding first or second output 

7 terminal, the signal from the other of said input terminals being transmitted through the other 

8 portion of the signal processing circuitry to the corresponding output terminal. 

1 7. The signal processing system of claim 1 wherein the first audio input signal is identified 

2 with the left channel signal of a stereophonic pair of audio signals and the second audio input 

3 signal is identified with the right channel signal of said stereophonic pair of audio signals. 

1 8. The signal processing system of claim 7 wherein the leading phase one of the first and 

2 second all-pass phase shift circuits is applied to the right channel signal of the stereophonic pair of 

3 signals and the lagging phase all-pass phase shift circuit is applied to the left channel signal. 

1 9. The signal processing system of claim 1 comprising: 

2 first and second input terminals for receiving the first and second audio input signals 

3 respectively; 
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4 first and second all-pass phase shift circuits connected to the corresponding first and second 

5 input terminals for providing a relative phase shift between their outputs which is approximately in 

6 quadrature over a specified band of low audio frequencies and tends to zero at higher audio 

7 frequencies while having a constant gain at all audio frequencies; and 

8 first and second output terminals connected respectively to the outputs of the first and 

9 second all-pass phase shift circuits. 

1 1 0. The signal processing system of claim 9 further comprising: 

2 first and second bass boost circuits interposed between the outputs of the first and second 

3 all-pass phase shift circuits and the corresponding first and second output terminals and providing 

4 a boost of approximately 3dB at low frequencies where the phase relationship between the outputs 

5 of the first and second all-pass phase shift circuits is approximately in quadrature, falling to unity 

6 at higher audio frequencies where the phase difference between them is approximately zero. 

1 11. The signal processing system of claim 9 further comprising: 

2 means for detecting the onset of low frequency transients having equal amplitudes in both 

3 of the first and second input signals; and 

4 means for temporarily reducing the relative low frequency phase shift between the first and 

5 second output signals to zero for the duration of such low frequency transients. 

1 12. The signal processing system of claim 9 further comprising: 

2 means for detecting which of the first and second audio input signals contains the higher 

3 average amplitude of low frequency audio signals; 

4 means for connecting the one of the audio input signals having the higher average low- 

5 frequency amplitude to the leading phase one of the all-pass phase shift circuits while connecting 

6 the other audio input signal to the lagging phase all-pass phase shift circuit; and 

7 means for connecting the output of the leading phase all-pass phase shift circuit to the 

8 output terminal corresponding to the audio input signal having the higher average low frequency 

9 amplitude and the output of the lagging phase all-pass phase shift circuit to the other of said first 

1 0 and second output terminals. 
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1 13. The signal processing system of claim 9 wherein the first audio input signal is identified 

2 with the left channel signal of a stereophonic pair of audio signals and the second audio input 

3 signal is identified with the right channel signal of the stereophonic pair of audio signals. 

1 14 The signal processing system of claim 9 wherein the leading phase one of the first and 

2 second all-pass phase shift circuits is applied to the right channel signal of the stereophonic pair of 

3 signals and the lagging phase all-pass phase shift circuit is applied to the left channel signal. 

1 15. A method for increasing the apparent spaciousness of low frequency sounds reproduced on 

2 a stereophonic pair of left and right loudspeakers in a listening room comprising the steps of: 

3 applying a varying relative phase shift between the left and right loudspeaker output signals 

4 such that their relative phase difference is approximately zero at high frequencies and 

5 approximately in quadrature at low frequencies but their relative amplitudes are maintained equal 

6 at all frequencies; 

7 locating the left and right loudspeakers with lateral symmetry in left front and right front 

8 comers of the listening room; 

9 thereby providing independent excitation of all low frequency modes of the listening room 
10 by both of said left and right loudspeakers. 

1 1 6. The method of claim 1 5 further comprising the step of: 

2 applying a bass boost of approximately 3dB at frequencies where the relative phase 

3 difference between said left and right loudspeaker output signals is in quadrature, falling to OdB at 

4 frequencies where the left and right loudspeaker output signals are in phase. 

1 17. The method of claim 1 5 further comprising the steps of: 

2 detecting the presence of low frequency transient signals which are approximately equal in 

3 the left and right audio input signals; and 

4 temporarily removing the relative phase shift between the left and right loudspeaker output 

5 signals for the duration of such low frequency transient signals. 
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1 18. The method of claim 1 5 further comprising the steps of: 

2 determining the relative average level of low frequency components of the left and right 

3 audio input signals; 

4 applying the signal having the larger average level of low frequency components through 

5 the channel having leading phase to the loudspeaker corresponding to the signal having the larger 

6 average level of low frequency components; and 

7 applying the signal having the smaller average low frequency content through the channel 

8 with lagging phase to the other loudspeaker of the pair. 

1 19. A system for maintaining a constant phase difference between the pressure outputs of 

2 loudspeakers in a listening room, comprising in the electronic case of a signal processing system 

3 for driving first and second power amplifiers an first and second loudspeaker systems in a listening 

4 room, comprising: 

5 first and second input terminals adapted to receive a stereophonic pair of audio input 

6 signals; 

7 corresponding first and second output terminals for producing a stereophonic pair of output 

s 

8 signals for driving the first and second power amplifiers and the first and second loudspeaker 

9 systems; and 

1 0 means for varying the phase relationship between the first and second output signals such 

1 1 that their phase difference tends towards zero at high audio frequencies and increases to 

12 approximately 90 degrees phase difference at low frequencies, the gain of the circuitry between 

13 each input terminal and the corresponding output terminal being approximately constant at all 

14 audio frequencies, thereby providing increased apparent spaciousness of low frequency sounds 

15 reproduced by said loudspeakers in the listening room. 

1 20. The system of claim 1 9 wherein the phase relationship varying means comprises 

2 mechanical manipulations of the loudspeaker enclosure volume, driver cone mass, port area and 

3 geometry, and crossover design, that combine to produce the desired 90 degree phase difference in 

4 the pressure output of the loudspeakers. 



-29- 



WO 99/33173 



PCT/US98/27174 



1/10 




FIG. / 



WO 99/33173 



PCT/US98/27174 




WO 99/33173 



PCT/US98/27174 




WO 99/33173 



PCT/US98/27174 




WO 99/33173 



PCT/US98/27174 



5/10 




17 18 
room width, foet 



WO 99/33173 



PCT/US98/27174 




WO 99/33173 



PCT/US98/27174 



7/10 



W 




•Pi H 



•eflectivity 



ai 



reflectivity - oi 



FIG. 12 



WO 99/33173 



PCT/US98/27174 




WO 99/33173 



9/10 



PCT/US98/27174 



30 




R8 38 




FIG. 15 



WO 99/33173 



PCT/US98/27174 



10/10 




INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US98/27174 



A. CLASSIFICATION OF SUBJECT MATTER 
IPC (6) :H03G 3/00 
US CL : 38 1/97 

According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 381/97, 1. 98 



Documentation searched other than rninimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant 



Relevant to claim No. 



A 
A 



US 5,233,665 A (VAUGHN et al) 03 August 1993, fig. 1. 
US 5,301,237 A (FOSGATE) 05 April 1994, fig. 4. 



1-20 
1-20 



□ 

Further documents are listed in the continuation of Box C. See patent family annex. 



Special categories of cited document*: 

A* document defining the general state of the art which is not considered 

to be of particular relevance 

E* earlier document published on or after the iniemaiional Tiling date 

L* document which may throw doubts on priority claim(i) or which is 

cited lo establish the publication date of another citauon or other 
special reason tas specified) 

O" document referring to an oral disclosure, use, exhibition or other 

means 

* P* document published prior to the international Tiling date but later than 
the priority date claimed 



later document published after the international Gling dale or priority 
data and not in conflict with the application but cited to understand 
the principle or theory underlying the invention 

document of particular relevance; the claimed invenuon cannot be 
considered novel or cannot be considered to involve en inventive step 
when the document is taken alone 

document of particular relevance; the claimed invention cannot be 
considered to involve an inventive step when ihe document is 
combined with one or more other such documents, such combination 
being obvious to a person skilled in the art 



document member of the same patent family 



Date of the actual completion of the international search 



12 FEBRUARY 1999 



Name and mailing address of the ISA/ US 
Commissioner o| w patents and Trademarks 
Box PCT 

Washington. D.C. 20231 
Facsimile No. (703) 305-3230 



Form PCT/ISA/210 (second sheei)(July 1992) : 



Date of mailing of the international search report 

07 APR 1999 



Authorized officer 

VIVIAN CHANG 
Telephone No. (703) 308- 




